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Four studies were conducted to investigate the source of reinforcement of Sidman avoidance. 
First it was found that the acquisition of avoidance was seriously impaired if there was no 
immediate stimulus consequence of responding, i.e., proprioceptive and auditory feedback. 
The second study showed that if the shock-shock interval was split so that only responses 
occurring in one half of the interval were effective in postponing shock, then learning was 
impaired. It made little difference which half of the interval was used. The third study 
demonstrated that Ss learn Sidman avoidance more quickly with a variable shock-shock inter- 
val than with the usual fixed-interval procedure. The results of the second and third experi- 
ments argue against the view that avoidance is reinforced by the decrease in conditioned 
aversiveness that occurs at long post-shock times. The final experiment indicated that Ss do not 
learn Sidman avoidance if the response-produced delay of shock is preceded by a shock, hence 
it seems unlikely that the crucial source of reinforcement is merely an overall reduction in 
shock density. All of the findings are consistent with the hypothesis that the avoidance response 
is reinforced by the decrease in conditioned aversiveness of stimuli at short post-response 
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times. This seems to be the case even at the beginning of acquisition. 


The importance of the warning stimulus in 
classical avoidance learning has been amply 
demonstrated by Kamin (1956) and others. 
Thus, whether the warning stimulus is re- 
garded as a conditioned negative reinforcer or 
as an elicitor of fear, the reinforcement of the 
avoidance response in the classical or discrim- 
inative avoidance situation may be readily at- 
tributed to the termination of the warning 
stimulus. But Sidman (1953) showed that rats 
could learn to press a bar to avoid (postpone) 
an electric shock even though there was no 
environmental warning stimulus, the termina- 
tion of which could be said to reinforce the 
response. Moreover with Sidman’s procedure 
it is not possible for the avoidance response 
merely to be a generalization of a previously 
acquired escape response; the bar press is 
never permitted to terminate shock but only 
to postpone the next scheduled shock. The 
question then is what reinforces bar pressing 
in Sidman’s situation? 

Sidman’s own first proposal (1953) followed 
the general lines of the argument Schoenfeld 
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(1950) had advanced to account for classical 
avoidance learning. According to Schoenfeld 
and Sidman (and subsequently Dinsmoor, 
1954), any response other than the avoidance 
response is likely sooner or later to occur in 
the presence of shock, at which time the pro- 
prioceptive, tactile and external stimuli result- 
ing from such non-avoidance behavior will be- 
come conditioned negative reinforcers. ‘Their 
termination will reinforce any response which 
interrupts them. As alternative responses be- 
come punished they become gradually sup- 
pressed until S has no response available ex- 
cept bar pressing. One serious difficulty with 
this interpretation is that it would seem to re- 
quire that the avoidance response be acquired 
rather gradually; in fact, it is commonly found 
that the response increases in strength precipi- 
tously, and frequently quite early in acquisi- 
tion training. Moreover, once the behavior is 
established it is usually found to be exceed- 
ingly resistant to extinction. 

More recently, Sidman (1962) has argued 
that the avoidance response may be reinforced 
by the reduction in shock density that it effects 
(S receives fewer shocks per unit time when it 
presses the bar than when it does not). 
Although a reinforcement mechanism based 
upon shock density may be effective early in 
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acquisition, when a few responses can mate- 
rially reduce shock density, Anger (1963) has 
pointed out that it cannot account for the con- 
tinued improvement in performance long 
after the shock density has been reduced to 
values lower than the rat can be reasonably 
assumed to discriminate. Some other source of 
reinforcement must be sought. 

The most recent and promising interpreta- 
tion of Sidman avoidance has been made by 
Anger (1963). He argues that the aversive 
stimuli, termination of which reinforces the 
avoidance response, are temporal in character. 
And just as there are two temporal payameters 
in a Sidman schedule so there are two classes 
of temporal stimuli that can be involved. One 
is the set of stimulus changes that follow a 
shock. The subject is safe immediately after a 
shock but whatever temporal stimuli S has 
should increase in conditioned aversiveness 
during the interval (the S-S interval) as the 
time for the next shock approaches. When the 
avoidance response occurs S encounters novel 
post-shock-time stimuli (PST stimuli) which 
can only be aversive by generalization since 
they have never been paired with shock. The 
avoidance response is reinforced by the reduc- 
tion in aversiveness it produces. 

However, Anger (1963) stresses the impor- 
tance of the second class of temporal stimuli 
and a reinforcement mechanism originally 
suggested by Mowrer and Keehn (1958). 
Stripped of Mowrer and Keehn’s concepts of 
fear, drive and perseverative traces it is this: 
whatever stimuli follow the avoidance re- 
sponse, they are aversive only by generaliza- 
tion from those present at the time of shock. 
Stimuli that come later, toward the end of the 
postponement interval (the R-S interval), are 
more like those present at the time of shock 
and hence are more aversive. Making the 
avoidance response removes the stimuli of long 
post-response-time (PRT) and_ reintroduces 
those of short PRT. The reduction in condi- 
tioned aversiveness reinforces the response. 
This hypothetical reinforcement mechanism 
makes it possible to account for the more or 
less regular pacing of avoidance responses that 
occurs when the behavior is so well established 
that few shocks are received. 

The purpose of the present study is to dem- 
onstrate that the temporal stimuli contingent 
upon responding also account for the early 
acquisition of the behavior. We will show that 
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itis PRT stimuli rather than PST stimuli that 
are crucial in the initial acquisition of Sidman 
avoidance. 


EXPERIMENT I 


In proposing that it is the conditioned aver- 
siveness of response-produced stimuli that are 
involved in reinforcing the avoidance re- 
sponse, Anger (1963) has not speculated about 
the locus or nature of such response-produced 
stimuli. They might be internal or external. 
The response time hypothesis requires only 
that there be some stimulus consequence of 
the response. The precise specification of the 
stimulus is not crucial so long as it can be as- 
sumed that there are some stimulus conse- 
quences of responding and that these stimulus 
consequences change with PRT. The purpose 
of Experiment I is to show that if the experi- 
mental situation is arranged so as to minimize 
the stimulus consequences of the avoidance 
response then it is not likely to be learned. 

Subjects. Twenty naive male albino rats of 
the Sprague-Dawley strain, approximately 3 
months old, were used. 

Apparatus. A Foringer 1102 test chamber 
was equipped with a modified operandum. 
The bar was 21% in. long, 1% in. in diameter, 
and protruded 2 in. into the box. Approxi- 
mately 10 g of force were required to move it. 
A bar depression of 1/16 in. closed a silent 
wiping switch; the action of the bar was 
damped so that there was no “feel” or sound 
when it was pressed or released. It was located 
4 in. above the grid. Recording and program- 
ming apparatus was located in another room. 
Thus, the sensory consequences of responding 
could be reduced to a minimum. Shocks were 
delivered from a high voltage dc constant-cur- 
rent source of 1.1 ma through a Lehigh Valley 
grid scrambler. 

Procedure. Shocks of .30-sec duration were 
presented at 5-sec intervals (the S-S interval) 
unless a bar press occurred, in which case the 
next shock came 15 sec after the last response 
(the R-S interval). A response during the R-S 
interval initiated a new R-S interval. 

Half of the Ss were run under “silent” con- 
ditions, 1.¢e., those just described. The other 
half were run under the usual conditions 
where immediate sensory feedback of respond- 
ing was provided; for these Ss an empty Davis 
pellet dispenser was activated with each re- 
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sponse. Subjects were run for 2 hr sessions, 
generally on successive days. 


Results 


Preliminary Considerations. Typically, the 
operant rate of bar pressing is initially rela- 
tively high and then drops off as the session 
continues. There is individual variation in 
operant rate which reflects primarily variation 
in gross body activity under conditions of peri- 
odic shock. Animals that flinch, freeze or de- 
velop stereotyped running reactions to the 
shock generally do not learn the avoidance re- 
sponse; those that explore, rear and attempt to 
climb the walls do learn. These gross activities 
raise the operant level of bar pressing.’ 

Effective avoidance does not, however, 
emerge gradually from the operant level; al- 
most invariably there is a clearly specifiable 
point at which the rate of responding sud- 
denly increases and the shock density suddenly 
decreases. The local rate of responding in this 
“acquisition burst” is usually at least as high 
as is found in well-established avoidance re- 
sponding. It represents a sharp transition from 
rates near 25 per hr to rates in the order of 
1000 per hr. Thus, there is rarely any question 
about whether S learns the avoidance response 
or at what point the learning becomes effec- 
tive. Since the present studies are concerned 
with the parameters that reinforce avoidance 
early in acquisition, the primary concern will 
be with the effect of various experimental con- 
ditions upon the location of this acquisition 
burst, both in terms of the number of re- 
sponses that precede it and how soon it occurs. 
The acquisition burst usually occurs either in 
the first or second hour of training or not at 
all; of the 27 Ss that learned the avoidance re- 
sponse in the four present experiments 26 
showed the acquisition burst in the first 2-hr 
session, and only one in the second session. 
The occurrence of the acquisition burst seems 
to require a relatively high operant rate of bar 
pressing, although, as Fig. 1 indicates, there 
are exceptions to this rule. It is as though $ 
must make a number of responses in the 
“operant phase” so that long PRT’s can be- 
come aversive. This is what the PRT hypothe- 


‘The “operant rate” referred to here is something 
of a misnomer since, presumably, the avoidance re- 
sponse is reinforced whenever it occurs. The phrase 
here refers to the rate of responding in the absence of 
any behavioral evidence of reinforcement. 
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sis requires, and generally this is the way it 
seems to be. But there are some cases where 
the acquisition burst occurs before this can 
happen. 

Some Ss learn to freeze while holding on to 
the bar. This behavior occurs under normal 
conditions rather frequently; it is a phase that 
may last from a few up to 30 min, although it 
may be passed over entirely. This pseudo- 
avoidance phase occurs when S freezes on the 
bar so that no responses occur during the R-S 
interval. When the shock comes on S is jolted 
and a bar press is registered. Such pseudo- 
avoidance behavior is also easy to detect in the 
cumulative records since it results in a some- 
what lower rate of pressing than that found in 
effective avoidance, and a shock density that is 
much higher and much more locally stable 
than that achieved by Ss that learn to respond 
during the R-S interval. This sort of behavior 
may be reinforced by the mechanism Sidman 
(1962) has suggested, t.e., by the reduction in 
shock density it produces. But more likely it is 
an artifact of the particular time intervals that 
were used here. Thus, Ss will freeze on the bar 
when shocks recur every 15 or 20 sec but not 
when they recur every 5 sec; all too frequently 
they freeze crouched in the corner when shocks 
are presented at 5-sec intervals, but not stand- 
ing by the bar. The more rapid shocks elicit 
too much competing behavior. 

The effect of sensory feedback. Seven of the 
10 Ss trained under normal conditions, 1.e., 
with the click of the feeder mechanism occur- 
ring with every response, learned the avoidance 
response. Only two of the 10 Ss trained under 
the silent conditions, t.e., with a minimum of 
response feedback, learned in the two sessions. 
The cumulative response records for all Ss are 
given in Fig. I. 

The four non-learners with the highest oper- 
ant rates of pressing were run additional ses- 
sions; their rates dropped to | per hr or less. 
Subjects with response feedback achieved the 
acquisition burst sooner and on the basis of 
fewer prior responses than Ss with no feed- 
back. The medians under the normal condi- 
tions were 12 min and 20 responses as against 
indeterminant medians larger than 240 min 
and 35 responses under the silent condition. 
Counting just learners, the differences are still 
in favor of more rapid onset of the acquisition 
burst for the normal, 2.e., click, Ss. The two Ss 
that did learn with minimal feedback, as well 
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Fig. 1. Cumulative response records during the first 
training session of rats given a click for response feed- 
back (upper records) or no click to minimize response 
feedback (lower records). 


as other animals observed under these condi- 
tions, went through a prolonged pseudo-avoid- 
ance phase of freezing at the bar. Those learn- 
ing with feedback showed shock density curves 
which, by the end of the 2-hr session, were still 
dropping. It would seem then that the intro- 
duction of a novel stimulus, such as the feeder 
click, which is contingent upon the avoidance 
response, markedly facilitates the acquisition 
of the response. 


EXPERIMENT II 


Anger (1963) has proposed that as time 
elapses after the last shock, the total stimulus 
situation becomes more aversive, more like 
that which exists at the time of shock. The 
conditioned aversiveness should be a mono- 
tonic increasing function of elapsed time in 
the S-S interval, since stimuli following shortly 
after a shock have themselves never been 
paired with shock and so can be aversive only 
by generalization. When a response occurs in 
the S-S interval, the animal soon encounters 
novel temporal stimuli which, because of their 
novelty, can only have generalized aversive- 
ness. It is, according to the PST hypothesis, the 
transition from the relatively highly aversive 
preshock stimuli to these stimuli at novel post- 
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shock times that provides the initial reinforce- 
ment of the avoidance response. 

Responses made early in the S-S interval 
should not be so effectively reinforced as those 
made late in the interval. This follows partly 
from the continued increase in conditioned 
aversiveness that continues through the S-S in- 
terval, and partly because they are more re- 
moved in time from the eventual drop in aver- 
siveness. This implication was tested in the 
present experiment by running two groups of 
Ss. For one, only responses made in the first 
half of the S-S interval were effective in post- 
poning shock; for the second, only responses 
made in the second half were effective. It 
would be predicted on the basis of the PST 
hypothesis the second-half group should learn 
Sidman avoidance more readily than the first- 
half group. 

Subjects. Seventeen naive male Sprague- 
Dawley rats, 3 to 4 months old, were used. 

Apparatus. The same as in Experiment I. 

Procedure. For the nine Ss in group 1, only 
responses made in the first half of the 5-sec S-S 
interval were effective in postponing the shock 
and only those responses produced the feed- 
back click. Responses in the second half were 
recorded but were of no consequence to S; they 
neither produced the feedback click nor post- 
poned shock. For the eight Ss in group 2 only 
bar presses made in the second half of the S-S 
interval were effective in producing feedback 
and in postponing shocks, and those in the 
first half were of no consequence to the ani- 
mal. For both groups, however, all responses 
outside the S-S interval, 2.e., all those made in 
the R-S interval, were effective in providing 
further postponement of shock, and all such 
responses produced the feedback click. Hence, 
under these conditions, once avoidance be- 
came established it occurred under conditions 
that approached normal for both groups, but 
the conditions under which the initial acquisi- 
tion burst occurred differed markedly between 
groups. 


Results 


Two of the nine Ss in group | learned the 
avoidance response, although in one of these 
(Rat #1-8) the behavior was of the pseudo- 
avoidance variety which lasted through three 
sessions. I'wo of the eight Ss in group 2 also 
learned the bar press response, and neither of 
them went through a pseudo-avoidance phase 
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(because responses immediately after a shock 
were not effective). 

The distribution of responses in the two 
halves of the S-S interval and the R-S interval 
is shown in Table 1 for each S for the first ses- 
sion. Subjects are ranked in the table accord- 
ing to the total number of responses. Note that 
there is very little difference in distribution of 
responses between the two groups, and that for 
all non-learners the preponderance of re- 
sponses occurred shortly after a shock when 
gross body activity was high. This is also true 
of the majority of responses in the R-S inter- 
val; they occurred as infrequent bursts of re- 
sponses that initiated an R-S interval). The 
only difference in pattern of responses is found 
in Ss that learn. There the distribution of re- 
sponses follows the reinforcement contingen- 
cies. Rat #1-9, cumulative records for which 
are shown in Fig. 2, displayed the most grad- 
ual acquisition of genuine avoidance of any S. 
There was no well-defined acquisition burst, 
although there is a sharp drop in shock density 
at point A, after 400 responses had been cumu- 
lated. 

The conjectures regarding the importance 
of temporal discrimination within the S-S in- 
terval are not substantiated by these results. 
There is no overall increment in responding 
in group 2 that the hypothetical greater rein- 
forcement from second-half responding is sup- 
posed to provide. Actually, the total number 
of responses in the first session was somewhat 
higher among non-learners than it was among 
non-learners in Experiment I where all re- 
sponses in S-S were supposedly reinforced. On 
the other hand, it could be argued that a com- 
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Fig. 2. Cumulative responses (upper record) and cum- 
ulative shocks (lower record) during the first training 
session of rat #1-9 which displayed unusually gradual 
acquisition of Sidman avoidance. 


parable number of responses were generated 
by far fewer reinforcements in group 2 than in 
group | (median of 13 as against 38). ‘There 
remains the puzzle of why acquisition in both 
groups was so poor. 


EXPERIMENT III 


The results of the previous experiment sug- 
gest that discrimination of temporal stimuli in 
the S-S interval are relatively unimportant in 
Sidman avoidance. It is conceivable that the 
PRT hypothesis is applicable to the acquisi- 
tion of avoidance right at the outset. To test 


Table 1 


The Distribution of Responses in the First and Second Half of the S-S Interval and in the R-S 
Interval by Rats for which Responding only in the First Half or the Second Half is Reinforced. 


Group 1 
R’s in §-S 

Rat First Second R’sin 
# Half Half R-S 
1-1 22 4 0 
1-2 21 2 4 
1-3 27 l 5 
1-4 24 6 4 
1-5 34 2 4 
1-6 37 3 l 
1-7 39 4 3 
1-8 689 30 333 
1-9 664 63 498 
Medians 34 4 4 


Group 2 
R’s in §-S 
Rat First Second R’sin 
# Half Half R-S 
2-1 25 6 0 
2-2 20 9 5 
2-3 31 3 1 
2-4 31 8 4 
2-5 37 8 2 
2-6 64 17 7 
2-7 523 637 451 
2-8 407 922 714 
34 8.5 4.5 
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this possibility two groups of Ss were run, one 
under the usual conditions with S-S interval 
fixed, and the other with a variable S-S inter- 
val. Under the variable S-S condition there 
should be little or no opportunity for the 
hypothesized PST mechanism to be effective. 
If PST discrimination does in fact play a part 
in the initial acquisition of avoidance behav- 
ior, then learning should be impaired. On the 
other hand, any learning which results in this 
situation should be the result of differential 
aversiveness of early and late PRT stimuli. 

Subjects. Fifteen naive albino female rats of 
the Dublin Disease Resistant (DDR) strain, 2 
to 3 months old, were used. 

Apparatus. The same used in the preceding 
experiments except that the S-S interval was 
governed by a tape programmer. 

Procedure. The seven Ss under the FI con- 
dition were presented with shocks every 5 sec 
as long as no bar press occurred. A bar press 
stopped the S-S programmer and delayed the 
next shock for 15 sec; a response during the 
R-S interval initiated a new R-S interval. 
Eight Ss of the VI group were presented shocks 
on a 5-sec VI schedule (6, 8, 2, 7, 2, 5, 7, 1, 9, 6, 
3, 1, 9, 8, 4, 3, 5, 4 sec). All responses produced 
the feedback click. 


Results 


Five of the 8 Ss in the FI group learned the 
response in the first test session. ‘This propor- 
tion is comparable to the 7 out of 10 Ss that 
learned under the same conditions in Experi- 
ment I. The striking finding here is that all Ss 
in the VI group learned the avoidance re- 
sponse. Comparing Ss in this group with just 
the learners of the FI group, their perform- 
ance is still superior. They made more re- 
sponses during the first session, received fewer 
shocks, showed a shorter latency to the acquisi- 
tion burst and achieved this burst on the basis 
of fewer responses. 

It would seem that the regular pacing of 
shocks under the FI S-S condition interferes 
with avoidance learning rather than providing 
the basis for it. Direct observation of Ss reveals 
that they are considerably more active under 
the VI condition than under the FI condition. 
Evidently the regular pacing of shocks at a 
fixed S-S interval permits S$ to learn postural 
adjustments such as freezing and perhaps 
other unauthorized avoidant behaviors which 
compete with bar pressing. Whatever the de- 
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tails of this competing behavior, it is clear that 
the avoidance response is not principally rein- 
forced, if at all, by the interruption of a tem- 
poral pattern of shocks. 


EXPERIMENT IV 


Sidman (1962) has suggested that an impor- 
tant variable in avoidance is the reduction of 
shock density. The animal is supposed to learn 
to discriminate the difference between shock 
and no shock, and the reduction of the density 
of shock provides reinforcement of the re- 
sponse that produces it. In the present experi- 
ment this interpretation was tested against the 
PRT hypothesis. The experimental situation 
was arranged so that after an avoidance re- 
sponse in the S-S interval that S-S interval was 
completed, the scheduled shock was presented, 
and then the R-S interval was interposed. Re- 
sponses made during the R-S interval were 
then effective in providing further postpone- 
ment of shock. This procedure permits a de- 
crease in the shock density but isolates the 
change in density from the response by a shock 
delivered on schedule. According to the shock 
density hypothesis, learning should occur un- 
der these conditions. However, if the PRT 
hypothesis provides the correct interpretation, 
then learning should be virtually impossible 
under these conditions. 

Subjects. Fourteen naive albino female rats 
of the DDR strain, 3 months old, were used. 

Apparatus. The same apparatus was used. 

Procedure. As before, the shock was pre- 
sented at 5-sec intervals in the absence of a bar 
pressing. When a response occurred the next 
scheduled shock was delivered but the appear- 
ance of the following shock was delayed for 
the R-S interval. Responses in the R-S interval 
provided further postponements of shock. 
Two groups were run, one with a 15-sec R-S 
interval and the other with a 45-sec R-S inter- 
val. 


Results 


With the 15-sec delay condition no S learned 
the avoidance response. With the 45-sec delay, 
one S learned. The behavior in this one case, it 
should be noted, was exceedingly persistent 
and precise pseudo-avoidance. 

In conclusion it would seem that the stimuli 
that follow a response and increase in aversive- 
ness with the passage of time since the re- 
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sponse (PRT stimuli) not only play a crucial 
role in the ultimate form Sidman avoidance 
behavior takes after prolonged training, but 
are also important in the initial acquisition of 
this behavior. There seems to be little or no 
avoidance learning unless the response results 
in some immediate stimulus consequence for 
the animal (Experiment I). The generalized 
components of this stimulation (or its traces) 
increase progressively in aversiveness during 
the R-S interval so that S is reinforced for 
abolishing these aversive components (or re- 
setting the trace). If low aversiveness of short 
PRT stimuli is precluded by the experimental 
conditions (Experiment IV), then learning to 
respond during the R-S interval does not oc- 
cur. The analogous temporal gradient of aver- 
siveness arising from the stimulus conse- 
quences of the shock itself (PST stimuli) seems 
to play a minor role in the acquisition of the 
behavior (Experiments II and ITI). 
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